Partitioning Input Space for Control-Learning

نویسندگان

Dean F. Hougen

Maria Gini

James Slagle

چکیده

This paper considers the eeect of input-space partitioning on reinforcement learning for control. In many such learning systems, the input space is partitioned by the system designer. However, input-space partitioning could be learned. Our objective is to compare learned and programmed input-space partitionings in terms of the overall system learning speed and proociency achieved. We present a system for unsupervised control-learning in temporal domains with results for both programmed and learned input-space partitionings. The trailer-backing task is used as an example problem. Many classic control-learning systems, such as Michie and Chambers' BOXES 11] and Barto, Sutton, and Anderson's ASE/ACE system 2], rely on the partitioning of a space of continuous input variables into a xed number of discrete regions. More recent systems, such as Fuzzy BOXES 18], have blurred the boundaries between the input regions but nonetheless rely on a partitioning of the input space. To use these systems, researchers have typically partitioned the space manually, prior to the application of the learning system. In these cases, the designer must analyze the problem to discover a suitable partitioning, or face a trade-oo between a ne partitioning that permits accurate approximation of complex functions or a gross partitioning that allows for rapid learning. The Self-Organizing Neural Network with Eligibility Traces (SONNET) scheme introduced by Hougen 4] is a general paradigm for the construction of connectionist networks that learn to control systems with a temporal component. In order to form mappings from input parameters to output responses, the system discretizes the input space by learning a partitioning of it, and learns an output response for each resulting discrete input region. SONNET systems have separate subsystems for learning input and output. The input subsystem learns input-space partitionings through self-organization and the output subsystem learns responses through the use of eligibility traces. Both input and output learning make use of topological ordering of the neural elements and associated neighborhoods, as in Kohonen's Self-Organizing Topological Feature Maps 8]. 1.1 Topology and neighborhoods Each SONNET subsystem consists of one or more artiicial neural networks. For each network there is a topological ordering of the neural elements that remains constant as the network learns. Each neural element is assigned an integer tuple of the same dimensionality as the network that uniquely deenes its coordinates in topology space. The existence of a network topology allows for the deenition of a distance function for the neural elements. This is …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Partitioning Input Space for Reinforcement Learning for Control

This paper considers the effect of input-space partitioning on reinforcement learning for control. In many such learning systems, the input space is partitioned by the system designer. However, input-space partitioning could be learned. Our objective is to compare learned and fixed input-space partitionings in terms of the overall system learning speed and proficiency achieved. We present a sys...

متن کامل

Modular SRV Reinforcement Learning Architectures for Non-linear Control

This paper demonstrates the advantages of using a hybrid reinforcement–modular neural network architecture for non-linear control. Specifically, the method of ACTION-CRITIC reinforcement learning, modular neural networks, competitive learning and stochastic updating are combined. This provides an architecture able to both support temporal difference learning and probabilistic partitioning of th...

متن کامل

Input Space Partitioning for Neural Network Learning

Neural Network (NN) is a supervised machine learning technique, which is typically employed to solve classification problems. When solving a classification problem with the conventional NN, the input data fed into the NN often consists of multiple attributes of various properties. However, training the NN with all of the available input attributes may not lead to the desirable performance consi...

متن کامل

Neural Incremental Attribute Learning in Groups

Incremental Attribute Learning (IAL) is a feasible approach for solving high-dimensional pattern recognition problems. It gradually trains features one by one. Previous research indicated that supervised machine learning with input attribute ordering can improve classification results. Moreover, input space partitioning can also effectively reduce the interference among features. This study pro...

متن کامل

Hierarchical Neuro-Fuzzy Systems Part II

This paper describes a new class of neuro-fuzzy models, called Reinforcement Learning Hierarchical NeuroFuzzy Systems (RL-HNF). These models employ the BSP (Binary Space Partitioning) and Politree partitioning of the input space [Chrysanthou,1992] and have been developed in order to bypass traditional drawbacks of neuro-fuzzy systems: the reduced number of allowed inputs and the poor capacity t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Partitioning Input Space for Control-Learning

نویسندگان

چکیده

منابع مشابه

Partitioning Input Space for Reinforcement Learning for Control

Modular SRV Reinforcement Learning Architectures for Non-linear Control

Input Space Partitioning for Neural Network Learning

Neural Incremental Attribute Learning in Groups

Hierarchical Neuro-Fuzzy Systems Part II

عنوان ژورنال:

اشتراک گذاری